NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Scalable Optimization Algorithm for Solving the Beltway and Turnpike Problems with Uncertain Measurements

Elder, CS; Hoang, M; Ferdosi, M; Kingsford, C (May 2024, Springer Nature)

Full Text Available
Efficient Heterogeneous Meta-Learning via Channel Shuffling Modulation

Hoang, M; Kingsford, C (January 2024, OpenReview)

Full Text Available
DeepMinimizer: A Differentiable Framework for Optimizing Sequence-Specific Minimizer Schemes

https://doi.org/10.1007/978-3-031-04749-7_4

Hoang, M.; Zheng, H.; Kingsford, C. (January 2022, RECOMB 2022: Research in Computational Molecular Biology)
Pe'er, I. (Ed.)
Minimizers are k-mer sampling schemes designed to generate sketches for large sequences that preserve sufficiently long matches between sequences. Despite their widespread application, learning an effective minimizer scheme with optimal sketch size is still an open question. Most work in this direction focuses on designing schemes that work well on expectation over random sequences, which have limited applicability to many practical tools. On the other hand, several methods have been proposed to construct minimizer schemes for a specific target sequence. These methods, however, require greedy approximations to solve an intractable discrete optimization problem on the permutation space of k-mer orderings. To address this challenge, we propose: (a) a reformulation of the combinatorial solution space using a deep neural network re-parameterization; and (b) a fully differentiable approximation of the discrete objective. We demonstrate that our framework, DEEPMINIMIZER, discovers minimizer schemes that significantly outperform state-of-the-art constructions on genomic sequences.
more » « less
Full Text Available
How Much Data Is Sufficient to Learn High-Performing Algorithms? Generalization Guarantees for Data-Driven Algorithm Design

Balcan, M. F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Vitercik, E. (January 2021, STOC Annual Conference)
null (Ed.)
Full Text Available
How Much Data Is Sufficient to Learn High-Performing Algorithms? Generalization Guarantees for Data-Driven Algorithm Design

https://doi.org/10.1145/3406325.3451036

Balcan, M-F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Vitercik, E. (January 2021, STOC annual conference)

Full Text Available
How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

Balcan, M.-F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Viterick, E. (January 2021, STOC 2021: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing)

Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case guarantee. Worst-case instances, however, may be rare or nonexistent in practice. A growing body of research has demonstrated that data-driven algorithm design can lead to significant improvements in performance. This approach uses a training set of problem instances sampled from an unknown, application-specific distribution and returns a parameter setting with strong average performance on the training set.
more » « less
Full Text Available
How much data is sufficient to learn high-performing algorithms?

Balcan, M.F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Vitercik, E. (January 2019, not applicable - unpublished manuscript)

Full Text Available

Search for: All records